arXiv: 1509.0035Ivl [stat.ME] 1 Sep 2015 


Defining and estimating causal direct and 
indirect effects when setting the mediator to 
specific values is not feasible 

JUDITH J. LOK* 

Department of Biostatistics, 

Harvard T.H. Chan School of Public Health, 

655 Huntington Avenue, Boston, MA 02115, USA; 
jlok@hsph.harvard.edu 

September 2, 2015 


Abstract 

Natural direct and indirect effects decompose the effect of a treatment 
into the part that is mediated by a covariate (the mediator) and the part that 
is not. Their definitions rely on the concept of outcomes under treatment 
with the mediator “set” to its value without treatment. Typically, the mech¬ 
anism through which the mediator is set to this value is left unspecified, and 
in many applicafions it may be challenging to fix the mediator to particular 
values for each unit or individual. Moreover, how one sets the mediator may 
affect the distribution of the outcome. This article introduces “organic” di¬ 
rect and indirect effects, which can be defined and estimafed without relying 
on setting the mediator to specific values. Organic direct and indirect ef¬ 
fects can be applied for example to estimate how much of the effect of some 
treatments for HIV/AIDS on mother-to-child transmission of HIV-infection 
is mediated by the effect of the treatment on the HIV viral load in the blood 
of the mother. 

Causal inference, Direct and indirect effect, HIV/AIDS, Mediation, Observational 
study, Organic direct and indirect effect. 


1 


1 Introduction 


Researchers are often interested in investigating the mechanisms behind effective 
treatments or exposures. The topic of which part of the effect of a treatment is 
“mediated” by a covariate is of particular importance. The mediated part of a 
treatment effect is due to treatment induced changes in a “mediator” covariate 
M. This is the so-called indirect effect; as opposed to the so-called direct effect, 
not mediated by covariate M. Mediation analysis has gained much prominence 
in methodological and empirical research in recent years. Mediation analysis is 
particularl y popular in the health sc iences, like epidemiology and psychology. In 
June 2015, Baron and Kennvl ( 1986h has over 52.000 citations in Google Scholar, 
many of them after 2010. Therefore, clarifying the assumptions required for me¬ 
diation analysis is paramount. 


nition and estimation of causal direct and indirect effects, see e.g. 

lobins and Greenland 


(1992) 

. Pearl 

(2001 

2011) 

Imai and others ( 

2010), 

VanderWeele 

(200^, 

Robins and Richardson 


(l2010h . lTchetgen-Tchetgenl (1201 Ih . In this literature, the controlled direct effect is 


the effect of a treatment had the mediator been set to a pre-determined value. The 
natural direct effect is the effect of a treatment had the mediator been set to the 
value that it would have taken without treatment. Thus, the natural direct effect 
for a particular unit or patient can be represented by Ti^Mo “ the difference 
between the unit’s outcome Ti Mq under treatment had the mediator been set to 
the value Mq it would have taken without treatment and the unit’s outcome Yq 
without treatment. Notice that the mediator is kept constant at Mq, so this is the 
natural direct effect not mediated by M. Similarly, the natural indirect effect can 
be represented by Yi — Yi^Mo^ where Yi is the outcome under treatment. It has 
been argued that to study the causal mechanisms by which particular treatments 
are effective, natural direct and indirect effects ar e r nore r e levant than controlled 
direct effects ( Pearl ( 200lh . VanderWeele ( 2009[) ). Pearl ( 2001 1 also notes that 
controlled indirect effects are not defined. 

As seen from the definition of natural direct and indirect effects, one needs 
“cross-worlds” quantities in order to define natural direct and indirect effects. In 
particular, Ti,Mo is the outcome under treatment but with the mediator set to the 
value. Mo, it would have taken without treatment. In practice, it may be rare that 
mediators take the same value with treatment. Mi, as without treatment, Mq. Even 
if this happened for specific sample units (e.g., patients; from now on: units), it 
would be impossible to identify those units. Under treatment, the value of the 
mediator without treatment is not observed, so it is unclear to which value the 


2 




































































mediator should be set for any particular unit. In addition, identification of natural 
direct and indirect effects relies on assumptions about the out comes ¥„ rn. of a un it 
under all combinati o ns of the treatment and the mediator (iPearl (1200 iL 1201 ill . 


VanderWeele (2009), Imai and otheA ( 2010f) . Tchetgen-Tchetgen ( 2011 )). This 


is a problem in many practical applications: setting the mediator to particular 
values is often not feasible if the mediator is not a treatment itself. If setting the 
mediator to a particular value m is not feasible, the interpretation of the m is 
unclear. However, the com mon approaches to caus a l med iation analysis all rely 
on the existence of all Ya^rn (IRobins and RichardsonI (l2010h ). 

To overcome these problems, this article proposes instead to base mediation 
analysis on newly defined “organic” interventions (/) on the mediator. Organic 
interventions I cause the mediator to have a specific distribution: the distribu¬ 
tion of the mediator without treatment, given pre-treatment common causes of 
mediator and outcome. An organic intervention could be an additional treatment 
that affects the distribution of the mediator. Theorem 14.41 shows that organic di¬ 
rect and indirect effects are often generalizations of natural direct and i ndirec t 
effects and the direct an d indirect effects introduced in iDidelez and othemiHOOw . 
Like the current article, iGenelettil (l2007h considers interventions on the mediator, 
but does not condition on common causes C of mediator and outcome, unless 
the interest is in effec t s cond i tional on C or in effects when C is manipulated. 
Robins and Greenland ( 1992 ). Pearl ( 200lb . and, for organic interventions. Sec¬ 


tion |4] argue that ignoring C can produce invalid estimators. Also Section [5] on 
uniqueness of organic direct and indirect effects requires that C is taken into ac¬ 
count. 

Most of the causal inference literature on mediation has adopted the so-called 
cross-worlds assumption, an assumption involvi ng the joint distribu t ion of coun- 
terfact u als under different value s of the treatment (IPeM (1200 iL l20Ilh , IVanderWeele 
( 2009 ). Imai and others (2010), Tchetgen-Tchetgen ( 2011 )). An issue with this 
assumption is that it can never be tested or imposed by design, not even in a clini¬ 
cal trial where the treatment and the mediator can both be set to any desired value 
by the experimenter. In contrast, whether ’’setting the mediator” is an organic 
intervention can be tested in such a clinical trial. 

The theory in this artic le turns out to lead to the same numerical results as in¬ 
troduced by o ther authors.iBaron and Kennvl(ll986l) and, more recently, e.g. [Pearl 


(2001 


2011 ). VanderWeele ( 2009 ). Imai and others ( 2010l) . Tchetgen-Tchetgen and Shpitser 


(120121) . and IDidelez and otherMlQQm . This article thus provides an interpretation 


of existing numerical results when the mediator cannot be set. 

Proofs can be found in Web-appendix A. I illustrate the usefulness of organic 


3 






























































































































direct and indirect effects as opposed to natural direct and indirect effects in two 
examples: 1. the smoking and low birth weight paradox, see Web-appendix B 
and 2. the effect of AZT, a drug used for the treatment of HIV, on mother-to- 
child transmission of HIV-infection. I investigate how much of the effect of AZT 
is mediated by the HIV viral load, the amount of HIV-virus in the blood of the 
mother. 


2 Setting and notation 


For ease of exposition, I first consider randomized treatments. Section |7] extends 
the analysis to non-randomized treatments. For each unit, observables include the 
following quantities. A is the randomized treatment, which is 1 for the treated and 
0 for the untreated. C a re pre-treatment common cau s es of the med iator and the 
outcome. As noted by IRobins and Greenland! (Il992h . IPearll (1200 ih . and others, 
these variables C have to be taken into account in order to identify the natural 
direct and indirect effect, even if the treatment is randomized. As will become 
clear later, pre-treatment common causes C also have to be taken into account 
to identify the organic direct and indirect effect. Like most of the literature on 
mediation, this article assumes that there are no post-treatment common causes of 
the mediator and the outcome. M is the observed value of the mediator. V is the 
observed outcome. Yq is the (counterfactual) outcome without treatment, and Yi is 
the (counterfactual) outcome under treatment. Obviously, for each unit either Yi 
or Yq is observed, but not both. Similarly, Mq is the mediator without treatment, 
and Ml is the mediator under treatment. I assume that C is observed first, then A, 
then M, and then Y. The Directed Acyclic Graph (DAG) of Figure 1 describes 
the set-up. In all of this article, it is assumed that observations and counterfactuals 
for the different units are independent and identically distributed. 


3 Natural direct and indirect effects: an overview 


Natural direct and indirect effects, introduced inIRobins and Greenlandit 1992) and 


Pearl! (1200 ih . are based on the outcome Yi,Mo under treatment with the mediator 
set to the value it would have taken without treatment, Mq. The natural direct 
effect is defined as Yi^mq ~ ^o- The natural direct effect is not affected by changes 
in the value of the mediator induced by the treatment, because for both Yi und 
Yq the mediator is equal to Mq. The natural indirect effect is defined as Yi — 


4 
























Yi^Mo- This is the mediated part: the only difference between these quantities is 
the change in the value of the mediator, Mi versus Mq. 


(2001 

,2011) 

. Robins and Greenland ( 

1992) 

. VanderWeele 

(2009). 

Imai and others 

(2010 

) and 

r cheteen-T cheteen 

(2011 

J) assume the existence of counterfactual 


outcomes unde r all possible combinations of th e treatment and the mediator, Ya^rn- 
In the words of [Robins and RichardsonI (l2010h . there has to be “reasonable agree¬ 
ment” as to what is the “closest possible world” in which the mediator has a spe¬ 
cific value, a value which is different from the one that w as observed. There are 
cases where reasonable agreement may exist. For example. iPearll (120011) describes 
a setting where the mediator is a treatment, aspirin, in which case the mediator 
could be set to specific values. However, in many practical situations the medi¬ 
ator of interest is not a treatment, and there is no known way in which one can 
set the mediator to a specific value. Then, the quantities yi,Mo and Ya^rn are not 
clearly defined. Cole and Frangaki^ (2009) provide a nice example: ’’there are 
many competing ways to assign (hypothetically) a body mass index of 25 kg/m^ 
to an individual, and each of them may have a different causal effect on the out¬ 
come”. 

The cross-worlds or mediator-randomization assumption that is generally used 
to identify natural direct and indirect effects states that 


Ya',raY-Ma \ C = C, A = a. 


( 1 ) 


In words: for a unit with treatment A = a and pre-treatment covariates C, the 
mediator under treatment a (Mo), should be independent of the outcome under 
any other treatment-mediator combination {Ya',m, the outcome under treatment 
a' had the mediator been set to m). Identification of natural direct and indirect 
effects thus involves assumptions about cross-worlds quantities. Suppose for now 
that the Ya'm are all well-defined: there is reasonable agreement about how to set 
the mediator to a specific value. Then ([T]) is similar to the classical assumption of 
no unmeasured confounding in causal inference (e.g.. [Robins and otfierj] (119921) ). 
To understand ([T]), notice that “nature” determines the values of the mediators Ma 
and the outcomes Ya'm, based on C and possibly other factors. Equation ([T]) thus 
states that given C, the Ya'm do not help to predict M^; or, nature did not have 
more information on the potential outcomes Ya'm to determine the value of the 
mediator Ma than recorded in C. In other words, all common causes of mediator 


5 














































and outcome have to be reeorded in C. Then, under a eonsisteney assumption, 


E (Ti,Mo) = / E[Y \ M = m,C = c,A = l] /M|c=c,A=o("i)/c(c)dm dc] 


{c,m) 


_ _ _ _ _ ( 2 ) 

the “mediation formul a”, see e.g. fPearll ( 200 ll 12011 ). IVanderWeelel (12009 ). and 


Imai and othera ( 120101 ) . 


Under eertain eonditions (strong parametrie assumptions, linear models and 
no exposure-mediator interaetion), the estimators for the natur al direet and indi- 
reet effi ets resulting from (jl]) are the same as the estimators in [Baron and Kenny 
(119861) . the founding artiele on direet and indireet effeets. The eausal inferenee 
literature on natural dire et and indireet effeets thus generalizes the approaeh of 


Baron and Kennyl (119861) and adds a eausal interpretation to their estimators. 


4 Definitions of organic intervention and organic di¬ 
rect and indirect effects 

This seetion defines organie direet and indireet effeets. Analogously to natural 
direet and indireet effeets, this artiele foeuses on interventions I that eause the 
mediator under treatment A = 1 and intervention J, Mi 7 = 1 , to have the same 
distribution as Mq, given the pre-treatment eommon eauses C of mediator and 
outeome. However, for individual units. Mi/=i does not need to be exaetly the 
value the mediator would have had without treatment, Mq. This is a eonsiderable 
relaxation, espeeially beeause this distribution ean be estimated from the observed 
data (provided C has been measured), while individual values of Mq are not ob¬ 
served under treatment. Henee, it is possible to imagine an intervention that leads 
to this distribution. I term this type of interventions organie beeause they depend 
on the entire distribution of Mq, the mediator without treatment, rather than on 
individual values of Mq. Write Yij=i for the outeome under treatment A = 1 and 
intervention I. Then, 

Definition 4.1 (Organie intervention). An intervention I is an organic interven¬ 
tion with respect to C if 




M 


Mi^/=i I (7 = c ~ Mq j C = c (3) 

17^1 = rn,C = c ^ Yi \ Mi = m,C = c, (4) 


both hold, where ~ indicates having the same distribution. 


6 




















Equation ([3]) says that / “holds the mediator at its distribution under no treat¬ 
ment”: given ( 7 , there is no differenee in the distribution of the mediator un¬ 
der treatment A = 1 eombined with intervention I and the distribution of the 
mediator under no treatment. Rather than the eross-worlds assumption of equa¬ 
tion ([U), I assume equation dH). Equation dH) intuitively states that / “has no 
direet effeet on the outeome”: for units with pre-treatment eommon eauses of 
mediator and outeome fixed at C = c, the prognosis of units under treatment 
“with mediator Mi/=i being equal to m under intervention /” is the same as the 
prognosis of units under treatment “with mediator Mi being equal to m with¬ 
out In other words, given C, treated units with mediator equal to m (Mi = 
m) without intervention / are representative of treated units with Mi/=i = m 
under intervention I. Equation dll) eould be relaxed by assuming instead that 
E I Mi^ 7 =i = m,C = c\ = E\Yi\ Mi = m, C = c]. If the intervention / 

on the mediator has a direet effeet on the outeome, equation dl]) fai ls to hold. Equa- _ 

tion dp is related to the assumption of “partial exehangeability” in [Robins and Greenland 
(119921) and ean be diseussed with subjeet matter experts (Web-appendix E may 
also help). 


Example 4.2 A = 1 could be a blood pressure lowering medicine, M blood 
pressure, and Y the occurrence of a heart attack. To investigate whether A = 1 
also has a direct effect on heart attacks, one could do mediation analysis. Sup¬ 
pose that Mo = + aiC + cq, and Mi = -f aiC + ei, and suppose 

that Co ~ ei, eo and ci are random error terms in M independent of C, and 
aQ°\ aQ^\ G M. Thus, treatment A = 1 shifts the distribution of the blood pres¬ 
sure by without changing its shape. Suppose an intervention I leads 

to Ml 7=1 = + aiC + eij=i. Then, I satisfies equation di]) if 1. = a® 

(that is, 1 = 1 shifts the distribution of the blood pressure, in the treated, by 
a® — ttg^^ without changing its shape) and 2. eij=i ~ eg is independent of C. 
Then, Mi 7=1 ~ Mg, leading to (121). Intervention I could for example be salt in a 
(possibly random) dosage depending on C. The effect of salt on heart attacks is 
believed to be through its effect on blood pressure (see for example the CDC web¬ 
site, http://www.cdc.gov/vitalsigns/Sodium/index.html), making equation (I?)) and 
thus Definition \4. 1 1 plausible for this intervention. For natural direct and indirect 


„(o) 


with- 


effects, one would need to be able to shift the distribution ofM by Oq 
out changing its shape, but additionally set Ci 7=1 = eg, resulting in M f=i = Mg. 
For the direct and indirect effects introduced in \Didelez. and othersi n200w . one 
would randomize e{ ~ eg independent of C, and then need to set the mediator to 
Ml 7=1 = Q!q°^ Y aiC -f ei^ 7 =i. Didelez and others ( 2006 ) avoid the use ofcoun- 


7 












terfactuals altogether using graphical models. Of the three interventions above, 
obviously, e{ = Cq places the strongest restriction. It is related to the assump¬ 
tion of rank preservation sometimes made in the causal inference literature. Rank 
preservation also implies that two units with the same observed data have the 
same counterfactual data. 

In this example, one could replace the fully parametric models by Mq = 
g{C,eo) and Mi = g{C,ei) + /9, with g some function of C and elements in 
M. Then, I needs to shift the distribution of Mi given C by —f3. Or, one could 
have Mq = g{C, cq) and Mi = g{C, ci) + /So + where now I needs to shift 
the distribution of Mi given C by —/So — f^iC. 

If a pre-treatment eommon eause C of mediator and outeome has not been ob¬ 
served, equation dH) without C is unlikely to hold. The reason is that the pre- 
dietive value of the mediator having a speeifie value under intervention / is not 
the same as the predietive value of the mediator having a speeifie value without 
intervention. The mediator Mi under treatment is predieted by the eommon eause 
C. However, if C is not ineluded, under intervention / the mediator Mij=i is 
not neeessarily predieted by C. So, the mediator Mi earries information on the 
eommon eause C, but the mediator under intervention I, Mi/=i, may not. Even 
if Ml 7=1 earries information on C, then the information on C from Mi 7=1 = m 
may be different than the information on C from Mi = m, beeause Mi 7=1 and 
Ml have a different distribution. As a eonsequenee, the prognosis under treatment 
of units with Mi = m is different from the prognosis under treatment of units 
with Mij=i = m, violating equation dH). Web-appendix B has a detailed example 
of the eonsequenees of ignoring a pre-treatment eommon eause C of mediator and 
outeome. 

When there is a post-treatment eommon eause C of the mediator and the 
outeome, equation dH) is also unlikely to hold. Assuming that the intervention / 
does not affeet C, the reason is the same as for unobserved pre-treatment eommon 
eauses. If the intervention / also ehanges C, basing mediation analysis on / 
results in estimating the effeet mediated by (C", M). 

If an intervention / satisfying equation dl]) is feasible, whieh ean be tested, 
equation dH) or its relaxation eould be tested as well, by eomparing the distribu¬ 
tions of Yi,7=i given (Mi^7=i, C) to the distribution of Yi given (Mi, C). In order 
to test this, an experiment must be earned out with three arms: “do not treat”, 
“treat”, and “treat under intervention J”. This is in eontrast with the existing liter¬ 
ature on natural direet and indireet effeets, the assumptions of whieh ean never be 
tested beeause they involve the joint distribution of eounterfaetuals under different 


8 


treatments, whieh ean never be jointly observed. 

Now the organie direet and indireet effeet of a treatment on the outeome ean 
be defined: 


Definition 4.3 (Organie direet and indireet effeet). Consider an organic interven¬ 
tion I. The organic direct effect of a treatment A based on I is E{Yij=i) —E{Yf). 
The organic indirect effect of a treatment A based on I is E(Yi) — E{Yij=i). 

Beeause the treatment is the same for both Yi and V)/=i, E{Yi) — E{Yij=i) 
is the organie indireet effeet, or mediated part of the effeet. It is the effeet of 
the organie intervention on the mediator, I, under A = 1. If the distribution of 
the mediator does not depend on A, I eould be ”no intervention on M”, and the 
organie indireet effeet is 0. The organie indireet effeet is also 0 if the intervention 
on the mediator does not affeet the outeome. Beeause the mediator has the same 
distribution for both Yi,/=i and Yq, E{Yij=i) — E{Yq) is the organie direet effeet. 
The direet effeet is the effeet of treatment eombined with an organie intervention 
as eompared to no treatment. Very loosely, the direet effeet is the effeet of a 
treatment that 1. has the same direet effeet as treatment ^4 = 1: the dependenee of 
Yi i=i on the eovariates C and on the mediator is the same as that of Yi, but 2. has 
no indireet effeet through the mediator (see ([3])). Notiee that E{Yi) — E{Yq) = 
(£'(Yi) — E{Yij=i)) + {E{Yij=i) — EiYo)). Thus, like for natural direet and 
indireet effeets, organie direet and indireet effeets add up to the total effeet of a 
treatment. Organie direet and indireet effeets often generalize natural direet and 
indireet effeets: 


Theorem 4.4 Under equation Q, natural direct and indire ct effects and the di¬ 
rect and indirect effects defined in iDidelez and othersi ^200w are special cases of 
organic direct and indirect effects. 


5 Uniqueness of organic direct and indirect effects 

Definition 14.31 of organie direet and indireet effeets depends on the organie inter¬ 
vention / and on the ehoiee of baseline eommon eauses of mediator and outeome 
C. Although the definitions of natural direet and indireet effeets also depend on 
the intervention (the mediator is set to a speeifie value), this has not usually been 
made explieit. I argued that C has to inelude all eommon eauses of mediator 
and outeome for equation dH) to be plausible, and thus for an intervention / to be 
organie. This seetion formalizes the notion of eommon eauses of mediator and 


9 





outcome, and argues that the organie direet and indireet effeets do not depend on 
(a) for given C, the ehoiee of organie intervention I or (b) on the ehoiee of eom- 
mon eauses C of mediator and outeome, even if more than one set of eommon 
eauses exists. 

Define a eommon eause of mediator and outeome given C as follows: 

Definition 5.1 (eommon eause j. X is not a common cause of mediator and out¬ 
come given C if either equation (|5l) or equation ® holds: 


X^LMq I c 


and 

XXFi I Mi,a 


X^LMi I C 


(5) 

( 6 ) 


That is, X is not a eommon eause if, given C, either X does not prediet the 
mediator, or, given the mediator, X does not prediet the outeome. In graphieal 
language: X is not a eommon eause of outeome and mediator if in a DAG that has 
C, X, M, and Y, there either is no arrow from X to M, or the r e is no direet arrow 
from X to Y. This definition is in line with, for example, Pearll ( 2000l) . If (all given 
C) X prediets the mediator and, given the mediator, X prediets the outeome, it 
is a eommon eause of mediator and outeome, and usually needs to be ineluded in 
C for equation dH) to hold with C (see the diseussion below Definition 14. II) . The 
following theorem is proved in Web-appendix A: 


Theorem 5.2 For given C, the organic direct and indirect effect do not depend on 
the choice of organic intervention I with respect to C. Furthermore, if C and C 
are different sets of common causes of mediator and outcome, C is not a common 
cause of mediator and outcome given C, and C is not a common cause of mediator 
and outcome given C, then the organic direct and indirect effect do not depend on 
whether the intervention is organic with respect to C or organic with respect to 
C. 


Thus, if we restriet ourselves to interventions that are organie with respeet to 
“eomplete” eommon eauses C (given C, any other pre-treatment eovariate X is 
not a eommon eause), organie direet and indireet effeets are unique, and one ean 
speak of “the” organie direet and indireet effeet. 


6 Identifiability and estimation of organic direct and 
indirect effects 

When the treatment is randomized, E{Yi) and E(Yq) ean simply be estimated 
by the averages of Yi and Yq among units reeeiving treatment and not reeeiving 


10 











treatment, respeetively. Therefore, in order to estimate the organie direet and 
indireet effeets of a randomized treatment, this seetion foeuses on estimating the 
expeetation of ^ 1 , 7 = 1 . The following theorem is the main result of this artiele: 

Theorem 6.1 (Organie direet and indireet effeets: the mediation formula for ran¬ 
domized experiments). Under randomized treatment and Definition of organic 
interventions \4.1\ the following holds for an intervention I that is organic with 
respect to C: 

E (Fi,7=i) = f E[Y \ M = m,C = c, A = 1] fM\c=c,A=oi^)fcic)dm dc. 

J {c,m) 

Notiee that to estimate E (yi_ 7 =i), only the distribution of M under A = 0 and of 
Y under A = 1 are needed. Thus, Theorem 16. II ean be used both in the absenee 
and in the presenee of treatment-mediator interaetion (where the expeetation of Y 
depends on M differently with or without treatment). Theorem 16.11 provides the 
same mediation formula as the previous literature (see Seetion [3]). This formula 
depends on observable quantities only, and ean be estimated using standard mod¬ 
els. The eontribution of the eurrent artiele is to show that the definition and thus 
the interpretation of direet and indireet effeets, as well as the eonditions under 
whieh estimators for these effeets are meaningful, ean be eonsiderably relaxed. 


7 Estimating organic direct and indirect effects in 
observational studies 


So-far, treatment was randomized. This seetion extends the identifieation to non- 
randomized treatments A. As before, A is treatment, whieh is 1 for t he treated and 
0 for th e untreated. I adopt the usual eonsisteney assumption (see e.g. [Robins and others 
( 1992h f relating the observed to the eounterfaetual data: 


Assumption 7.1 (Consisteney). If A = 1, M = Mi and Y = Yi. If A = 0, 
M = Mq and Y = Yq. 


For observational data, I allow that there exist baseline eovariates Z (beyond the 
eommon eauses of mediator and outeome, C) that need to be ineluded in the 
analysis in order to eliminate eonfounding: 

Assumption 7.2 (No Unmeasured Confounding). 


AX {Yi, Ml) \C,Z and AALYq \ C, Z and AALMq \ C, Z. 


11 








Thus, given the measured pre-treatment eovariates C and Z, treatment should not 
depend on the prognosis of the units with or without treatment. For Assump¬ 
tion 17.21 to hold, it is suffieient that ((7, Z) ineludes all the eommon eauses of 
the treatment, the mediator, and the outeome. This is a partieular representation 
of t he usual assumption of n o unmeasured eonfounding in eausal inferenee (see 
e.g. [Robins and ot/zeral 19920 . Assumption 17.21 eannot be tested statistieally. Sub- 
jeet matter experts have to indieate whether they believe enough pre-treatment unit 
eharaeteristies have been observed in order for Assumption l7.2l to be plausible. 

Under Assumption 17.2[ the expeetation of Yi and Yq ean be estimated using 
marginal struetural models, the G-eomputation formula, or struetural nested mod¬ 
els. Thus, I foeus on the expeetation of Yi i=i. Seetion |4] argued that in order 
for an intervention to be organie with respeet to C, C usually has to inelude all 
eommon eauses of outeome and mediator. Therefore, if an extra Z was neees- 
sary for Assumption 17.21 of no unmeasured eonfounding to hold, I will assume 
that given C, Z is not a eommon eause of mediator and outeome, as defined in 
Definition 15.11 Then, 


Theorem 7.3 (Organie direet and indireet effeets: the mediation formula for ob¬ 
servational studies). Assume No Unmeasured Confounding Assumptions^ Con¬ 
sistency AssumptionSU intervention I is organic with respect to C as in Defini¬ 
tion O and given C, Z is not a common cause of mediator and outcome as in 
Definition \5.1\ Then 


^(n,/=i) = 

/ E[Y \ M = m,C = c,Z = z,A = 1] fM\c=c,z=z,A=oi^)fc,z{c z)dm d{c, z). 

J (c, 2 :,m) 

The proof is in Web-appendix A. The resulting organie direet and indireet effeets 
are similar to Theorem 16.1[ in terms of observable quantities only, and ean be 
estimated using standard methods. 


8 Application: Mother-to-child transmission of HIV/AIDS 


HlV-infeetion ean be transmitted from an HIV-positive mother to her infant in 
utero, during birth, and by breast feeding. The rate of HIV-transmission ean be 
lowered by avoiding breast feeding, as well as by treatments sueh as antiretroviral 
treatment (ART) and zidovudine (AZT). ART and AZ T lower the amount of HIV- 
virus, the HIV viral load, in the blood of the mother. ISperling and othera (119961) 


12 
























describe that the effect of AZT on mother-to-child transmission of HIV-infection 
is surprisingly large, given the limited effect of AZT on the HIV viral load in 
the blood of the mother. They estimated that less than 20% of the effect of AZT 
on mother-to-child transmission is due to the effect of AZT on the mother’s HIV 
viral load, but their analysis was not based on current notions of direct and indirect 
effects. 

This section describes how one could investigate how much of the effect of 
AZT on mother-to-child transmission is mediated by the effect of AZT on the 
HIV viral load in the blood of the mother (from now on, the HIV viral load). I 
argue that the organic direct and indirect effects defined in this article are well- 
defined and identified in this situation, whereas natural direct and indirect effect 
are undefined. 

Suppose one would like to investigate the likely effect on mother-to-child 
transmission of a potential new treatment that has the same effect on HIV vi¬ 
ral load as AZT but no direct effect on the child’s HIV status. Potentially, a 
low dosage of some type of ART could be such treatment. Let / be an inter¬ 
vention that, without AZT treatment, causes the distribution of HIV viral load 
to be the same as under AZT treatment; / represents the potential new treat¬ 
ment. Here, in contrast to most of the literature on mediation analysis, which 
focuses on the effect of an intervention on the mediator under treatment, interest 
focuses on the effect of an intervention on the mediator under no treatment. In 
order to directly apply the method described in this article, we therefore re-code 
A = 0 if a person was treated with AZT, and A = 1 if a person was not treated 
with AZT. In the case of a linear model without treatment-mediator interaction, 
E\Y I M = m, A = a, (7 = c] = + (5im + (52a + /SJc (no term (5iam), both 

approaches lead to the same direct effect, (52, and therefore also to t he same indi¬ 
rect effi ct. In general, both approaches can lead to different results. IVanderWeele 
( 20091) and Web-appendix C discuss when each definition is most useful; this de¬ 
pends on the context of the investigation. 

In this example, one would expect that if AZT has a direct effect on mother- 
to-child transmission, a mother’s adherence to AZT treatment is a post-treatment 
common cause of both HIV viral load M and mother-to-child transmission Y, be¬ 
cause both M and Y will be reduced under better adherence. Thus, one seems to 
need the post-treatment covariate “adherence”, ad, in C. However, equation dH) 
seems reasonable without compliance: if all pre-treatment common causes are 
in C, so adherence is not a proxy for other confounders, adAL(Yi, Mi)\C (re¬ 
call 1 indicates no treatment in this section). If adherence is not an issue for /, 
ad_LL(Yi/=!, Mij=i\C. And if it is: because / does not have a direct effect on 


13 






mother-to-child transmission, Yi /=i_LLa(i|Mi/=i, C. Thus, if equation (jH) holds 
with ad in the eonditioning event and all pre-treatment eommon eauses are in C, 
equation dH) will also hold without ad. 

For ease of exposition, suppose that AZT treatment is randomized (the ap- 
proaeh ean be generalized to observational studies as in Seetion|7]). I now illus¬ 
trate how to use the identifieation result of Seotion[^to estimate the indireet effeet 
of AZT on mother-to-ehild transmission. Suppose that Mi ~ Mq -\- l3i-\- \ C 

holds for M equal to log HIV viral load. Suppose in addition that the probabil¬ 
ity of mother-to-ehild transmission without treatment follows a logistie regression 
model of the form logit(y = 1 \ M = m, C = c, A = 1) = 9o + 9JM + 0^(7. 
Notice that one only needs such a model for mother-to-child transmission under 
A = 1 (no treatment in this case). Then, by Web-appendix D, it follows that 

E{Yij=i) = E [1/(1 + exp(-0o -9i{M-l3i- td^C) - 9lC)) | A = 1]. (7) 

This expression can be estimated as indicated in Web-appendix D. This leads to an 
estimator for the indirect effect that does not use data on the outcomes for treated 
mothers. 

In contrast to the organic direct and indirect effects, the natural direct and in¬ 
direct effects are undefined in this application. They involve Yi^Mq, whether or 
not a newborn is infected without AZT but with the HIV viral load of the mother 
set to the value it would have had under AZT (A = 0 here). How one could set 
the mediator to the value under AZT is unclear. One can imagine treatments, for 
example low-dose ART, that have the same effect on HIV viral load as AZT, as 
needed for organic direct and indirect effects. However, it is unlikely that such a 
treatment would, for all mothers, set the HIV viral load to the exact same value 
it would have had under AZT. If AZT were a combination of substances, some 
combination of a substance that affects HIV viral load and another substance that 
might directly affect mother-to-child transmission, one could imagine setting the 
HIV viral load to involve only the substance that affected HIV viral load. How¬ 
ever, like many treatments, AZT is just one substance. I therefore conclude that 
for a treatment like AZT, the organic direct and indirect effects are more natural 
than their natural counterparts. 

9 Discussion 

This article shows that, in contrast to the assumptions behind natural direct and 
indirect effects, cross-worlds quantities and setting the mediator are not necessary 


14 


to define eausal direet and indireet effeets. This leads to newly defined organie 
direet and indireet effeets. Furthermore, this artiele proves that, in eontrast to nat¬ 
ural direet and indireet effeets, identifieation of organie direet and indireet effeets 
does not rely on the existenee of eounterfaetual outeomes under all eombinations 
of the treatment and the mediator. For identifiability of organie direet and indireet 
effeets, a distributional assumption linking the distribution of the outeome under 
an organie intervention to the data replaees the eross-worlds assumption whieh 
identifies natural direet and indireet effeets. This artiele foeuses on organie inter¬ 
ventions I, whieh eause the distribution of the mediator given C to be the same 
as Mo, rather than setting the mediator value to Mq, as in natural direet and indi¬ 
reet effeets. In applieations in the health or soeial seienees, like epidemiology or 
psyehology, one often wants to eonsider whieh part of the effeet of a treatment is 
mediated through some eovariate or trait. For example, one may want to investi¬ 
gate how mueh of the effeet of antiretroviral treatment, ART, on AIDS-defining 
events and death is mediated by the CD4 eount. In this example, it is easier to en¬ 
vision an intervention that eauses the CD4 eount to have a partieular distribution 
rather than setting the CD4 eount to a speeifie value for eaeh patient. If inter¬ 
ventions on the mediator are ineoneeivable, both natural and organie direet and 
indireet effeets are undefined. 

I have shown that the proposed organie direet and indireet effeets are identi¬ 
fied by the same expressions as developed previously in the literature for natural 
direet and indireet effeets. The eontribution of this artiele is to show that these 
mediation formulas hold in substantially more generality. As a eonsequenee, esti¬ 
mators based on the mediation formulas have a mueh broader eausal interpretation 
than previously shown. The new definitions introdueed in this artiele are easy to 
interpret and ean therefore be easily diseussed with subjeet matter experts. For 
an intervention / to be organie it has to be that, given pre-treatment eharaeteris- 
ties C, the outeome under treatment “for a unit with Mi = m under treatment” 
is representative of the outeome under treatment “if the organie intervention / 
eaused Mi/=i = m”. This ean be interpreted as that, under treatment, the organie 
intervention has no direet effeet on the outeome. 

An organie intervention J is a eonsiderable relaxation of Mi /=i = Mq. Still, it 
may be diffieult to find an organie intervention. Notiee, however, that if there is an 
intervention I sueh that equation dH) holds, then in some eases it may be possible 
to eonstruet an organie intervention I by adapting the dosage of J as a funetion of 
C (deterministieally or randomly) in a way sueh that equation (|3]) holds. If there 
is interest in figuring out what might be the benefit of an intervention with only a 
direet or only an indireet effeet, if sueh an intervention would be developed in a 


15 


lab, organic direct and indirect effects are of interest. In any ease, being able to 
aetually earry out organie interventions is not neeessary to identify and estimate 
organie direet and indireet effeets. Rather, organie interventions ean be employed 
as thought experiments useful to frame the analysis and define the parameters of 
interest. 

Natural direet and indireet effeets are defined at the individual level as well as 
the population level, whereas organie direet and indireet effeets are defined only 
at the population level. This refieets that an organie intervention does not set the 
mediator to a pre-speeified value for eaeh unit. 

In related work, the appendix of I VanderWeelel (120121) eonsiders interventions 
that eause the mediator to have the same distribution as without treatment, eondi- 
tional on C. However, for identifieation he still assumes existenee of all and 
Ya,m-LLM I A, C, L, where L is a post-treatment eommon eause of mediator and 
outeome. This is problematie if one eannot set the mediator to partieular values. 

Following the previous literature, this artiele studies interventions that do not 
affeet pre-treatment eommon eauses of mediators and outeomes. For example, 
inherited risk faetors are thought to be eommon eauses of low birth weight and in¬ 
fant mortality. For equation dH) to be plausible, sueh eommon eauses of mediators 
and outeomes have to be taken into aeeount. The eonsequenees of ignoring eom¬ 
mon eauses are illustrated in Web-appendix B for the direet effeet of smoking on 



eall pure direet effeet, “is non-manipulable relative to A, M and Y in the sense 
that, in the absenee of assumptions, the pure direet effeet does not eorrespond to a 
eontrast between treatment regimes of any randomized experiment performed via 
interventions on A, M and YT Organie direet and indireet effeets are not subjeet 
to that eaveat. If there exists an organie intervention / (not neeessarily Mi /=i = 
Mo), then the organie direet and indireet effeet indueed by / are identified from 
the experiments “do not treat”, “treat”, and “treat under intervention /.” Both 
eonditions for I to be organie ean be tested on the basis of these experiments, 
and the organie direet and indireet effeets do not depend on the ehoiee of organie 
intervention I. 

Under an agnostie model, whieh does not assume the existenee of eounter- 
faetual outeomes, the natural direet and indireet effeets are obviously not defined: 
they are based on the eross-worlds eounterfaetuals Yi^Mq- In eontrast, if an organie 
intervention I exists, the organie direet and indireet effeets eould have been equiv¬ 
alently defined without eounterfaetual outeomes, beeause they ean be defined on 


16 















the basis of interventions; see Web-appendix F for details. 

In future work, I will show that in contrast to natural direct and indirect ef¬ 
fects, organic direct and indirect effects can be extended to provide an identifi¬ 
cation result for the case where there are post-treatment mediator-outcome con- 
founders. This will provide ano ther alternative to the three quantities described in 
VanderWeele and other^ ( 20\A ). 

The methodology in this article could also be applied to the study of the ef¬ 
fects of future treatments based on prior data. Consider, for example, the effect of 
a future treatment to lower immune activation (M in the notation of this article) 
in HIV-positive patients. Suppose that this future treatment is aimed to eventu¬ 
ally prevent clinical events (Y). Assume that under the future treatment, patients 
would have a specific distribution of immune activation Mi, and the future treat¬ 
ment has no direct effect on the outcome Y : conditional on a set of covariates C, 
the prognosis of patients under the future treatment, Yi, is the same as the progno¬ 
sis Y of patients with the same immune activation in the observed data (compare 
with dH)). Then the mean outcome under the future treatment can be estimated 
using a sample counterpart of 


^(Yi) = [ E[Y 
J [c^m) 


M = m, C = c] fMi\c=c{rri)fc{c)dmdc. 


The proof follows the same lines as Theorem l6.ll but the interpretation is different 
because the future treatment does not necessarily cause immune activation to have 
the same distribution as some existing treatment A. Of course, since the identi¬ 
fying assumption of equation dH) cannot be verified without experimental data of 
the future treatment, an experiment with the future treatment would be necessary 
to confirm this result. Th is last formula also provM es a mathematical underpin¬ 
ning of the application in iNaimi and otherd (12014) . who estimate the controlled 
direct effect of an intervention when ’’only a portion of the population’s mediator 
is altered”. 

To conclude, this article introduces organic direct and indirect effects and pro¬ 
vides identification and estimators for these effects. The assumptions are weaker 
than for natural direct and indirect effects. 


10 Funding 

This work was supported by the National Institutes of Health [grant number ROl 
AH 00762]. The content is solely the responsibility of the author and does not 


17 








necessarily represent the official views of the National Institutes of Health. 


Acknowledgements 

The author thanks Eric Tchetgen Tchetgen and Tyler VanderWeele for sharing 
their drafts and comments, Victor DeGruttola for comments and for suggesting 
the mother-to-child HIV transmission example, and Alberto Abadie for extensive 
comments on this article. 


References 

Baron, R M and Kenny, D A. (1986). The moderator-mediator variable dis¬ 
tinction in social psychological research: conceptual, strategic, and statistical 
considerations. Journal of Personality and Social Psychology 51, 1173-1182. 

Cole, S R and Frangakis, C E. (2009). The consistency statement in causal 
inference: a definition or an assumption? Epidemiology 20(1), 3-5. 

Dawid, a P. (1979). Conditional independence in statistical theory (with discus¬ 
sion). Journal of the Royal Statistical Society B 41, 1-31. 

Didelez, V, Dawid, A P and Geneletti, S. (2006). Direct and Indirect 
Effects of Sequential Treatments. In: Proceedings of the 22nd Annual Confer¬ 
ence on Uncertainty in Artificial Intelligence. Arlington, VA: AUAI Press, pp. 
138-146. 

Geneletti, S. (2007). Identifying direct and indirect effects in a non- 
counterfactual framework. Journal of the Royal Statistical Society. Series B 
(Statistical Methodology 69(2), I99-2I5. 

Hernandez-Diaz, S, Schisterman, E F and Hernan, M A. (2006). The 
birth weight ’’paradox” uncovered? American Journal of Epidemiology 164, 
III5-II20. 

Imai, K, Keele, E and Yamamoto, T. (2010). Identification, infrerence and 
sensitivity analysis for causal mediation effects. Statistical Science 25, 51-71. 


18 



Naimi, a I, Moodie, E E M, Auger, N and Kaueman, J S. (2014). Stochas¬ 
tic mediation contrasts in epidemiologic research: interpregnancy interval and 
the educational disparity in preterm delivery. American Journal of Epidemiol¬ 
ogy 180(4), 436-445. 

Pearl, J. (2000). Causality. Models, reasoning, and inference. Cambridge: Cam¬ 
bridge University Press. 

Pearl, J. (2001). Direct and indirect effects. In: Proceedings of the 17th an¬ 
nual conference on uncertainty in artifial intelligence (UAI-01). San Erancisco: 
Morgan Kaufmann, pp. 411-442. 

Pearl, J. (2011). The Mediation Eormula: A guide to 

the assessment of causal pathways in nonlinear models. 
Technical Report, University of California, Eos Angeles. 

<http://ftp.cs.ucla.edu/pub/stat_ser/r37 9.pdf>. 

Robins, J M, Blevins, D, Ritter, G and Wulfsohn, M. (1992). G- 
estimation of the effect of prophylaxis therapy for pneumocystis carinii pneu¬ 
monia on the survival of AIDS patients. Epidemiology 3(4), 319-336. 

Robins, J M and Greenland, S. (1992). Identifiability and exchangeability 
for direct and indirect effects. Epidemiology 3, 143-155. 

Robins, J M and Richardson, T. (2010). Alternative Graphical Causal Mod¬ 
els and the Identification of Direct Effects. Technical Report, Center for Statis¬ 
tics and the Social Sciences, University of Washington. 

Sperling, R S, Shapiro, D E, Coombs, R W, Todd, J A, Herman, 
S A, McSherry, G D, O’Sullivan, M J, Vandyke, R B, Jiminez, E, 
Rouzioux, C, Elynn, P M and others. (1996). Maternal viral load. Zidovu¬ 
dine treatment, and the risk of transmission of Human Immunodeficiency Virus 
type 1 from mother to infant. New England Journal of Medicine 335(22), 1621- 
1629. 

Tchetgen-Tchetgen, E j. (2011). On causal mediation analysis with a sur¬ 
vival outcome. The International Journal of Biostatistics 7(1), 1-38. 

Tchetgen-Tchetgen, E J and Shpitser, I. (2012). Semiparametric The¬ 
ory for Causal Mediation Analysis: efficiency bounds, multiple robustness and 
sensitivity analysis. The Annals of Statistics 40(3), 1816-1845. 


19 



Valeri, L and VanderWeele, T J. (2013). Mediation analysis allowing for 
exposure-mediator interaction and causal interpretation: theoretical assump¬ 
tions and implementation with SAS and SPSS macros. Psychological Meth¬ 
ods 18(2), 137-150. 

VanderWeele, T J. (2009). Marginal Structural Models for the Estimation of 
Direct and Indirect Effects. Epidemiology 20(1), 18-26. 

VanderWeele, T J. (2012). A three-way decomposition of a total effect into 
direct, indirect, and interactive effects. Technical Report, Harvard University, 
Boston. 

VanderWeele, T J, Vansteelandt, S and Robins, J M. (2014). Effect 
decomposition in the presence of an exposure-induced mediator-outcome con- 
founder. Epidemiology 25(2), 300-306. 


20 



Figure 1: DAG summarizing the data. 



C 


Because treatment A is randomized, the pre-treatment covariate C is not a cause 
of A in the DAG. 


21 



A Web-appendix A: Proofs 

Proof of Theorem 4.3: (4.3) is trivial for Mi 7=1 = Mq. (4.3) also follows 
immediately if Mi 7=1 is a random draw of Mq given C. Furthermore, it is easy 
to see that for Mi 7=1 = Mq, and if all Yq m are well-defined, then cross-worlds 
Assumption (3.1) implies equation (4.4): for Mi 7=1 = Mq, equation (4.4) states 
Yi^rn I Mo = m,C = c Yi,m | Mi = m, C = c, and under equation (3.1) for 
randomized treatment A, Yi ^ depends, for given C, neither on Mq nor on Mi. 
For Ml 7=1 a random draw, equation (4.4) states Yi^^ I Mi 7=1 = m,C = c ~ 
Yi rn I Ml = m,C = c, and under equation (3.1) for randomized treatment A, 
Yi m depends, for given C, not on any mediators. 

Proof of Theorem 5.2: First, let / be an intervention that is organic with respect 
to C. Then 


F;(Fi,7=i) = E(E[Fi,7=1 I Mi,7=1,C']) 

= / E [Yij=i I Ml, 7=1 =m,C = c] fMij^i\c=c{m)dm fc{c)dc 

J (c,m) 

= f E [Yi I Ml = m, C = c] fMo\c=cim)dm fcic)dc. (8) 

J (c,m) 

In this proof, the first two equalities follow from the definition of conditional 
expectation. The third equality follows from Definition 4.1, (4.3) and (4.4). Thus, 
the choice of / does not influence the direct and indirect effect, as long as it is 
organic with respect to C. 

Next, let be an intervention that is organic with respect to C and and 
intervention that is organic with respect to C. I assumed that C is not a common 
cause of mediator and outcome given C, and C is not a common cause of mediator 
and outcome given C; hence there are 4 different cases, with either (5.5) or (5.6) 
holding for C and 67, respectively. I will show that under any of the 4 different 
cases. 




' (c,m,c) 


E Yi\Mi = m,C = c,C = c /mo|c=c,c=cM/c,c(c, c)dcdmdc. 


Since the conditions and the result are symmetric in C, C, it follows that also 


E vr )^ 


' (c,m,c) 


E Yi \ Mi = 7n,C = c,C = c /mo|c=cC=cM/c' c)dcdm dc. 


22 



But then, E = E 

Suppose first that CiLMo | C and C'JlMi | C. Then 

Efy/ 


.IC 


E 


' (c,m) 


' (c,m,c) 


* (c,m,c) 


' (c,m,c) 


Yi I Ml = m,C = c 


fMo\c=c{^)fcic)dmdc 


Yi\ Mi = m,C = c,C = fc\M^=niY=ci^)^^fMo\c=cirn)fc{c)dmdc 
Yi\ Mi = m,C = c,C = c fc\c=cic)fMo\c=c,c=ci^)fcic)dcdmdc 
Yi\ Mi = m,C = c,C = c /mo|c=c,c=cM/c,c(c, c)dcdmdc. 


The first line follows from equation dS]). The seeond line eonditions on C. In the 
third line I ehanged the order of integration and used C'JlMo | C and C'JlMi | C. 
Alternatively, suppose that CilYi|Mi, C. Then, 


E{Y{ 


rE 


E 




E 


' {c,m) 


' (c,m),c 


' (c,m),c 


Ti I Ml = m,!" = c fMo\c=ci'>^)fc(d)dmdc 
Yi\ Mi=m,C = c ^/Mo|c=c,c=c("^)/c|c=c(c)c?c/c(c)dmdc 
Fi I Ml = m,C = c /Mo|c=c,c=c(’^)/c|c=c(c)/c,c(cc)dcdmdg 


Yi I Ml = m,C = c,C = c 


/mo|c=c,c=cM/c,c(c, c)dcdmdc. 


The first line follows from equation ([8]). The second line conditions on C. The 
last line follows from CilYi | Mi,C. □ 


23 


















Proof of Theorem 6.1: 





E[Yi\ Mi=m,C = c] fMo\c=cim)dm fc{c)dc 

E[Yi\ Mi=m,C = c,A = l] fMo\c=c,A=o{'m)dm fc{c)dc 

E[Y \ M = m,C = c, A = 1] fM\c=c,A=oi'^)fc{c)dmdc. 


The first equality follows from equation ([8]). The seeond equality follows from the 
faet that treatment was randomized; this implies that 

AX (Yi, Ml) \C and AALMq \ C. 

The last equality follows from the randomization. □ 

Proof of Theorem 7.3: Theorem 7.3 assumed that either equation (5.5) or equa¬ 
tion (5.6) holds for Z. Suppose first that equation (5.5) holds for Z. Then 


E{Yij=i) 


'{c,m) 

f I 

' (c,m) J z 

f 

i 

’ {c,z,m) 

f 

' {c,z,m) 


E[Yi\ Mi=m,C = 
f E[Yi\ Ml = m,Z 
E[Yi\Mi=m,Z = 
E[Yi\Mi=m,Z = 
E[Y \M = m,Z = 


c] fMo\c=c{m)dm fc{c)dc 

= z,C = c] fz\Mi=m,c=c{z)dz fMo\c=c{m)fc{c)dmdc 
= ^,67 = c, A = 1] fz\c=c{z) fMo\z=z,c=c{'m) fc{c)dz dm dc 
= z,C = c,A = 1] fMo\z=z,c=c,A=o{^)fc,z{c, z)dz dm dc 
z,C = c,A = l] fM\z=z,c=c,A=o{rn)fc,z{c, z)dm dz dc. 


The first line follows from equation ([8]). The seeond line conditions on Z. The 
third line uses (5.5), for both Mq and Mi, Assumption 7.2, and changes the order 
of integration. The fourth line follows from Assumption 7.2. The last line follows 
from Assumption 7.1. 


24 


Next, suppose that equation (5.6) holds for Z. 



m,C = 
m,C = 
= m, Z 
= m, Z 
m, Z - 


c] fMo\c=c{m)dmfc{c)dc 

c] / fMo\z=z,c=c{'m)fz\c=c{z)dz dm fc{c)dc 
J Z 

= Z,C = c] fMo\z=z,c=ci'm)fz,c{z, c)dm dz dc 
= z,C = c,A = 1] fMo\z=z,c=c,A=o{m)fz,c{z, c)dm dz dc 
z,C = c,A = 1] fM\z=z,c=c,A=o{m)fz,c{z, c)dm dz dc. 


The first line follows from equation ([8]). The seeond line follows by eonditioning 
on Z. The third line follows from (5.6) and ehanging the order of integration. The 
fourth line follows from Assumption 7.2. The fifth line follows from Assump¬ 
tion 7.1. □ 


B Web-appendix B: The smoking-and-low-birth-weight 
paradox 

Seetion 4 argued that if a eommon eause of mediator and outeome C has not been 
observed, it is often not reasonable to think that equation (4.4) without C would 
hold. As an example, this appendix eonsiders the ease of maternal smoking and 
infant mortality. The effeet of smoking during pregnaney {A = 1) on infant mor¬ 
tality may be mediated by low birth weight. It turns out that a naive analysis leads 
to the eon elusion that the direet effeet of mat ernal smoking on infant mortality is 
benefieial. Hemandez-Diaz and others ( 2006h explain this “birth weight paradox” 
and provide an explanation for the possible biases. This appendix shows how this 
relates to the setup of this artiele. 

For exposition simplieity assume that whether a pregnant woman smokes or 
not is unrelated to her prognosis with respeet to low birth weight or eomplications 
in her infant in the “smoking” and “not smoking” seenarios. So, differences in 


25 









outcomes between smokers and nonsmokers are eaused by smoking only, effee- 
tively implying that the treatment “smoking” ean be eonsidered randomized. In 
praetiee, this may be violated if women with other unhealthy behaviors besides 
smoking are more likely to smoke. Those eomplieations are ignored here, be- 
eause the issues addressed in this appendix are present even under randomized 
treatment, and relaxing the randomization assumption was already diseussed in 
Seetion 7. 

Some infants may have a low birth weight due to genetieally determined birth 
defeets, whieh are likely not eaused by smoking, or due to environmental eauses 
other than smoking like malnu trition. These eauses may be mor e predietive of 
infant mortality than smoking dHernandez-Diaz and others] (l2006h l. For exposi¬ 
tion simplieity this appendix bases the diseussion on genetieally determined birth 
defeets as eommon eauses of birth weight and infant mortality. Denote these by 
C. Suppose that, as in most studies, C is not observed. Now eonsider an interven¬ 
tion / (Definition 4.1 equation (4.3)) whieh eauses birth weight for the smoking 
mothers to have the same distribution as the birth weight for non-smoking moth¬ 
ers, without ehanging genetieally determined birth defeets C. Then, the prognosis 
of an infant had the mother smoked and “had the infant had a normal birth weight 
Ml/=! under intervention J” is most likely not the same as the prognosis of an 
infant had the mother smoked and “had the infant had normal birth weight Mi 
without intervention”. Without the intervention, in an infant of a smoking mother 
with normal birth weight Mi, genes responsible for birth defeets are most likely 
more favorable: the birth weight was normal without intervention, even while 
the mother was smoking. So, one would think that the prognosis Yi is good for 
sueh an infant. Under intervention I, some of the infants of smoking women with 
normal birth weight Mi/=i will have genetieally determined birth defeets: the 
birth weight has been intervened on to be normal without ehanging genetieally 
determined birth defeets. The possibility of genetieally determined birth defeets 
would lead to a worse prognosis Yij=i for sueh infants. Thus, equation (4.4) will 
generally not hold in this situation. 

Next, I eonsider how this issue affeets the estimators of the direet and indi- 
reet effeets if C is ignored (whieh it has to be, beeause it is assumed that C is 
unobserved). Let the outeome Y be an indieator of infant mortality, and let / be 
an intervention for whieh equation (4.3) holds. If C is ignored, E{Yij=i) would 
be estimated using the data for women who smoked but who had infants with 
relatively high birth weights, beeause that is the distribution of the birth weights 
Ml /=! under intervention I. As argued in the previous paragraph, this approaeh 
is too optimistie, and thus the mortality probability E{Yi i=i) is underestimated. 


26 





Thus, the part of the effeet of smoking that is mediated through low birth weight, 
the indireet effeet of smoking, is overestimated. As a eonsequenee, the direet 
effeet of smoking on infant mortality is unde restimated. _ 

This is in line with what was found in e.g. Hemandez-Diaz and otheri { 2006 ). 
who studied eontrolled direet effeets, and found that eonditional on birth weight, 
smoking and infant mortality were negatively assoeiated in infants with low birth 
weight. A n aive approaeh would thus eqnelude that the direet effeet of smoking 
is benefieial. Hernandez-Diaz and ( 2006h explained this by noting that low 
birth weight may be more harmful if eaused by genetie birth defeets than if eaused 
by smoking. As outlined above, this is a violation of equation (4.4). 

The solution to this issue is to try to inelude in C as many pre-treatment eom- 
mon eauses of mediator and outeome as feasible. In the ease of the genetieally 
determined birth defeets in the above example, this eould perhaps be done through 
observed traits of the newborn babies. If this is unfeasible, eonelusions may be 
flawed beeause equation (4.4) fails to hold. The direetion of the bias ean be rea¬ 
soned as deseribed in the previous paragraph: in this example, ignoring birth de- 
feets results in an overestimation of the organie indireet effeet of smoking (me¬ 
diated by birth weight), and an underestimation of the detrimental organie direet 
effeet of smoking (not mediated by birth weight). 

The diseussion in this seetion illustrates the importanee of the assumptions 
behind mediation analysis. One ean eompare whether the distribution on the left 
hand side of equation (4.4) puts more mass on larger values of the outeome or on 
smaller values of the outeome as eompared to the distribution on the right hand 
side. Thus, an advantage of the eurrent approaeh is that the direetion of the bias 
that results from laek of validity of equation (4.4) ean be diseussed in the eontext 
of eaeh partieular applieation. 


C Web-appendix C: Interventions on the mediator 
under treatment or on the mediator under no treat¬ 
ment? 


There has been some diseussion in the previous literature about whether one 
should eonsider setting the mediator to it s value under treatment versus setting 
it to its value without treatment (see e.g. IVanderWeeleL l2009h . As indieated in 
Seetion 8, the approaeh in this artiele ean easily be extended to ineorporate both. 
To illustrate what might be of most elinieal interest in a partieular setting, eonsider 


27 













two scenarios. In scenario 1, an alternative treatment I' changes the mediator the 
same way conventional treatment A does, without having a direct effect on the 
outcome. This would be especially relevant for example if the direct effect of 
treatment A is a harmful side effect. In this case, one would want to compare 
the distribution of the outcome under no treatment with the distribution of the 
outcome under no treatment if the mediator under intervention I', Moj'=i, has 
the same distribution as Mi. For example, with Yqj'=i the outcome under /' with 
A = 0, one would want to estimate E (Fo,/'=i — ^o) as the effect mediated through 
M. This is a different quantity than the organic indirect effect of Section 4, but 
can be estimated in a similar way by changing the coding of A as in Section 8 . In 
scenario 2 , the quantity of interest is the effect of an alternative treatment A, where 
A has the same direct effect as treatment A, but does not affect the mediator M. 
This would be especially relevant if the effect of treatment A on the mediator is a 
harmful side effect. In this case one would want to consider an intervention such 
that Ml 7=1 under treatment has the same distribution as Mq. In that situation, the 
quantity of interest is i?(Yi 7=1 — Fq) > the organic direct effect of treatment A = 1 
as defined in Definition 4.1. Scenario 1 motivates an intervention that causes the 
mediator without treatment to have the same distribution as Mi, scenario 2 mo¬ 
tivates an intervention that causes the mediator with treatment to have the same 
distribution as Mq. When studying the biological mechanisms by which particular 
treatments are effective, both types of interventions may be of interest. 


D Web-appendix D: Inference under randomized treat¬ 
ment 

I now illustrate how one might use the identification result of Section 6 to esti¬ 
mate E (Fi 7=1), and hence the organic indirect and direct effects, under semi- 
parametric assumptions. 

Suppose that Mi ~ Mq -f + 0'^C \ C, with ^1 g R an d 0^ G This 


would be the case if, as in e.g. IValeri and VanderWeelel (12013h . M follows a re¬ 
gression model M = 0q + 0iA + 02^ + 01 AC + e, where the random variable 
e has the same distribution given C under treatment as without treatment, and 
with /3i 6 M and 02-,0'i G Suppose in addition that the expected value of 
Y given C and M under treatment follows some parametric model of the form 
E\Y\M = m,C = c, A = l] = fg{m, c). Notice that this last model applies 
only to the distribution of Y conditional on A = 1, not conditional on A = 0. This 


28 





implies that the model does not restriet treatment-mediator interaetions. Then, 
Theorem 6.1 implies E {Yij=i) 

= E [fe{M — /9i — /3JC, C)\ A = l] (proof: see below). This ean be estimated 
by fitting the models for {3 and 6 using standard methods, plugging the parameter 
estimates in, and replaeing the expeetation given A = 1 by its empirieal average. 
Standard errors ean be estimated with the bootstrap. 

Notiee that the resulting estimator uses ehanges in the distribution of the medi¬ 
ator with and without treatment, but the distribution of the outeome only in treated 
units. This leads to an estimator for the indireet effeet that does not use data on 
the outeomes for untreated units. 


Valeri and VanderWeelel (12013h provide eode to estimate direet and indireet 


effeets based on the mediation formula for the ease where M and Y both follow 
regression or logistie regression models. 

Proof of inference under randomized treatment: 




' (c,m) 


' (c,m) 


E[Y \ M = m,C = c,A = 1] fM\c=c,A=o{'m)fc{c)dmdc 
fe{m, c)fM\c=c,A=i{m + (3i + I3^c)fc{c)dmdc 


' (c,m) 


fe{m - I3i- /3^CyC)fM\c^c,A^i(m)fc(c)dmdc 




where the first equality follows from Theorem 6.1, the seeond equality follows 
from Ml ~ Mq + l3i + 13'^C \ C, see above, the third equality from a ehange 
of variables with fh = m + (3i + and the fourth equality from the faet that 
treatment A is randomized, and therefore the distribution of C does not depend on 

A. □ 


E Web-appendix E: Organic direct and indirect ef¬ 
fects: independence assumptions instead of dis¬ 
tributional assumptions 

Some readers may be more at ease with independenee assumptions underlying 
eausal inferenee than with the distributional assumptions eonsidered in the main 


29 





text. This can be done in the current context as follows. Let R describe the pos¬ 
sible treatments as follows: i? = 0: treatment 0, i? = 1: treatment 1 and R = 2: 
treatment 1 combined with an ’’organic” intervention / on the mediator. Equiva¬ 
lent to the definition in the main text, the definition for / being an organic inter¬ 
vention on the mediator could be formulated as that both equations dH) and (fTOl) 
are satisfied: 

MALR \C = c,R^l (9) 

Y _LLi? \ M = m,C = c, R ^ 0. (10) 

Of course, for easier interpretation, R^l could be replaced by = 0 or i? = 2” 
and i? 7 ^ 0 could be replaced by ”R = lox R = 2”. The first of these assumptions 
states that, for given pre-treatment covariates C, the mediator is independent of 
whether the mediator was intervened on during treatment versus no treatment was 
given. The second of these assumptions states that, for given mediator and pre¬ 
treatment covariates C, the outcome is independent of whether the mediator got 
its value m because it was intervened on during treatment versus treatment 1 was 
given. 


F Web-appendix F: Organic direct and indirect ef¬ 
fects without counterfactuals 


Some o f the literature on causal inf erenc e is avoiding coun terfactuals, see e.g. lDawid 
( 1979 ). Didelez and others ( 2006 ). and Geneletti ( 2007b . Although this has not 
been a concern in the main manuscript, some readers may appreciate that organic 
direct and indirect effects can also be defined without counterfactuals, if ’’organic” 
interventions are possible in a three-arm clinical trial with i? = 0: treatment 0, 

R = 1\ treatment 1 and R = 2: treatment 1 combined with an ’’organic” inter¬ 
vention / on the mediator. In this setting, the definition for I being an organic 
intervention on the mediator is that both equations (fTTI) and (fT^ are satisfied: 


M \ R = 2,C = M \ R = 0,C = c (11) 

Y \ R = 2, M = m,C = c ^ Y \ R = 1, M = 7n,C = c. (12) 

Equation (fTTI) states that the distribution of the mediator under treatment com¬ 
bined with the intervention I is as under treatment 0, and equation (fT^ intuitively 
states that the intervention / on the mediator has no direct effect on the outcome 

E. 


30 















The organic direct and indirect effects based on I can now be defined as 

E[Y\R=1]- E[Y\R = 2] 


and 

E[Y\R = 2]- E[Y\R = 0]. 

As in the main paper, the mediation formula holds for E[Y \ R = 2] because 


E[Y \R = 2] 


E{E[Y\M, C,R = 2]) 

[ E[Y\M = m,C = c,R 


E[Y\M = m,C = c,R 


because R is randomized, (fTTl) . and (fT^ . 


2] fM\C=c,R=2 {.fn)fc\R=2 (c) 
^]fM\C=c,R=o{m)fc{c), 


31 


